Integrating Fault-Tolerant Feature into TOPAS Parallel Programming Environment for Distributed Systems

نویسندگان

  • Giang T. Nguyen
  • Viet D. Tran
  • Margaréta Kotocová
چکیده

In this paper, TOPAS1 a new parallel programming environment for distributed systems is presented. TOPAS automatically analyzes data dependence among tasks and synchronizes data, which reduces the time needed for parallel program developments. TOPAS also provides supports for scheduling, dynamic load balancing and fault tolerance. Experiments show simplicity and efficiency of parallel programming in TOPAS environment with fault-tolerant integration, which provides graceful performance degradation and quick reconfiguration time for application recovery.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Application Recovery in Parallel Programming Environment

In this paper, fault-tolerant feature of TOPAS parallel programming environment for distributed systems is presented. TOPAS automatically analyzes data dependence among tasks and synchronizes data, which reduces the time needed for parallel program developments. TOPAS also provides supports for scheduling, load balancing and fault tolerance. The main topics of this paper is to present the solut...

متن کامل

Distributed and fault-tolerant thread management for parallel computations

Application of distributed computations becomes more attractive with the advances in computer networks. Lemick is an environment for automation of real-time distributed programming expressed in the terms of more traditional multithreaded programming. This paper gives a brief introduction into the implementation of Lemick distributed threads. A focus is made on the mechanisms employed to support...

متن کامل

Fault-tolerant Distributed Applications In LiPS

Performing computations using networks of workstations is increasingly becoming an alternative to using a supercomputer. This approach is motivated by the the vast quantities of unused idle-time available in workstation networks. Unlike computing on a tightly coupled parallel computer, where a xed number of processor nodes is used within a computation, the number of useable nodes in a workstati...

متن کامل

A Model for Adaptive Fault-Tolerant Systems

An adaptive computing system is one that modiies its behavior based on changes in the environment. Since one common type of environment change in a distributed system is network or processor failure, fault-tolerant distributed systems can be viewed as an important subclass of adaptive systems. As such, use of adaptive methods for dealing with failures in this context has the same potential adva...

متن کامل

Fault Tolerance Lessons Applied to Parallel Computing

This paper describes an approach to fault-tolerant parallel computing which is based on the experiences with the most successful fault-tolerant software – the transaction processing systems. The algorithms presented here have less runtime overhead and faster recovery than most preceding approaches. In the Pact parallel programming environment fault tolerance is provided fully user transparent i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002